[SPARK-52432][SDP][SQL] Scope DataflowGraphRegistry to Session #51544

JiaqiWang18 · 2025-07-18T00:06:20Z

What changes were proposed in this pull request?

Scope DataflowGraphRegistry to spark connect session. This is done by adding it as a member to the spark connect SessionHolder. This is added here because pipeline executions are also scoped to this class.

Added getter/setter methods to access dataflow graphs for the session.

Added logic to drop all dataflow graphs when session is closed.

Why are the changes needed?

Currently DataflowGraphRegistry is a singleton, but it should instead be scoped to a single SparkSession for proper isolation between pipelines that are run on the same cluster.

This allows proper cleanup of pipeline resources when session is closed.

Does this PR introduce any user-facing change?

No

How was this patch tested?

Added new testcases to test data flow graph session isolation and proper clean up.

Was this patch authored or co-authored using generative AI tooling?

No

JiaqiWang18 · 2025-07-18T16:34:46Z

@AnishMahto @sryza

AnishMahto

LGTM, just nits

AnishMahto · 2025-07-18T18:01:55Z

sql/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SessionHolder.scala

+  private[connect] def createDataflowGraph(
+      defaultCatalog: String,
+      defaultDatabase: String,
+      defaultSqlConf: Map[String, String]): String = {
+    dataflowGraphRegistry.createDataflowGraph(defaultCatalog, defaultDatabase, defaultSqlConf)
+  }
+
+  /**
+   * Retrieves the dataflow graph for the given graph ID.
+   */
+  private[connect] def getDataflowGraph(graphId: String): Option[GraphRegistrationContext] = {
+    dataflowGraphRegistry.getDataflowGraph(graphId)
+  }
+
+  /**
+   * Retrieves the dataflow graph for the given graph ID, throwing if not found.
+   */
+  private[connect] def getDataflowGraphOrThrow(graphId: String): GraphRegistrationContext = {
+    dataflowGraphRegistry.getDataflowGraphOrThrow(graphId)
+  }
+
+  /**
+   * Removes the dataflow graph with the given ID.
+   */
+  private[connect] def dropDataflowGraph(graphId: String): Unit = {
+    dataflowGraphRegistry.dropDataflowGraph(graphId)
+  }
+
+  /**
+   * Returns all dataflow graphs in this session.
+   */
+  private[connect] def getAllDataflowGraphs: Seq[GraphRegistrationContext] = {
+    dataflowGraphRegistry.getAllDataflowGraphs
+  }
+
+  /**
+   * Removes all dataflow graphs from this session. Called during session cleanup.
+   */
+  private[connect] def dropAllDataflowGraphs(): Unit = {
+    dataflowGraphRegistry.dropAllDataflowGraphs()
+  }
+


Is there any particular reason why we added these delegator methods, rather than just having callers call SessionHolder.dataflowGraphRegistry.blah()?

If its for access modifier reasons, why not just do private[connect] lazy val dataflowGraphRegistry?

AnishMahto · 2025-07-18T18:09:33Z

...nnect/server/src/test/scala/org/apache/spark/sql/connect/pipelines/PythonPipelineSuite.scala

  def buildGraph(pythonText: String): DataflowGraph = {
    val indentedPythonText = pythonText.linesIterator.map("    " + _).mkString("\n")
+    // create a unique identifier to allow identifying the session and dataflow graph
+    val identifier = UUID.randomUUID().toString


nit: rename to something more descriptive like customSessionIdentifier

AnishMahto · 2025-07-18T18:10:39Z

...nnect/server/src/test/scala/org/apache/spark/sql/connect/pipelines/PythonPipelineSuite.scala

      throw new RuntimeException(
        s"Python process failed with exit code $exitCode. Output: ${output.mkString("\n")}")
    }
+    val activateSessions = SparkConnectService.sessionManager.listActiveSessions


val activeSessions

AnishMahto · 2025-07-18T18:48:33Z

...nnect/server/src/test/scala/org/apache/spark/sql/connect/pipelines/PythonPipelineSuite.scala

-    val dataflowGraphContexts = DataflowGraphRegistry.getAllDataflowGraphs
+    // get the session holder by finding the session with the custom UUID set in the conf
+    val sessionHolder = activateSessions
+      .map(info => SparkConnectService.sessionManager.getIsolatedSessionIfPresent(info.key).get)


getIsolatedSession() instead of getIsolatedSessionIfPresent(...).get

AnishMahto · 2025-07-18T18:51:12Z

.../test/scala/org/apache/spark/sql/connect/pipelines/SparkDeclarativePipelinesServerTest.scala

+      .getIsolatedSessionIfPresent(SessionKey(defaultUserId, defaultSessionId))
+      .getOrElse(throw new RuntimeException("Session not found"))


nit: just call getIsolatedSession

AnishMahto · 2025-07-18T18:52:42Z

...test/scala/org/apache/spark/sql/connect/pipelines/SparkDeclarativePipelinesServerSuite.scala

+      val graph1 = sessionHolder.getDataflowGraph(graphId1).getOrElse {
+        fail(s"Graph with ID $graphId1 not found in session")
+      }
+      val graph2 = sessionHolder.getDataflowGraph(graphId2).getOrElse {
+        fail(s"Graph with ID $graphId2 not found in session")
+      }


nit: just call getDataflowGraphOrThrow

sryza

Nice

sryza · 2025-07-21T15:19:05Z

Merged to master

### What changes were proposed in this pull request? Scope `DataflowGraphRegistry` to spark connect session. This is done by adding it as a member to the spark connect [SessionHolder](https://github.com/apache/spark/blob/master/sql/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SessionHolder.scala#L54). This is added here because pipeline executions are also [scoped](https://github.com/apache/spark/blob/master/sql/connect/server/src/main/scala/org/apache/spark/sql/connect/service/SessionHolder.scala#L125) to this class. Added getter/setter methods to access dataflow graphs for the session. Added logic to drop all dataflow graphs when session is closed. ### Why are the changes needed? Currently `DataflowGraphRegistry` is a singleton, but it should instead be scoped to a single SparkSession for proper isolation between pipelines that are run on the same cluster. This allows proper cleanup of pipeline resources when session is closed. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Added new testcases to test data flow graph session isolation and proper clean up. ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#51544 from JiaqiWang18/SPARK-52432-session-graphRegistry. Authored-by: Jacky Wang <[email protected]> Signed-off-by: Sandy Ryza <[email protected]>

jackywang-db added 2 commits July 17, 2025 16:17

wip

90d59cf

test

b5db9cc

github-actions bot added SQL CONNECT labels Jul 18, 2025

jackywang-db added 4 commits July 17, 2025 20:01

fmt

2ee4801

nit

8a9e1d2

nit

b918491

test

5ed466c

JiaqiWang18 changed the title ~~[WIP] Spark 52432 session graph registry~~ [SPARK-52432][SDP][SQL] Scope DataflowGraphRegistry to Session Jul 18, 2025

AnishMahto approved these changes Jul 18, 2025

View reviewed changes

address feedback

f8aba9e

sryza approved these changes Jul 21, 2025

View reviewed changes

sryza closed this in 0177265 Jul 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-52432][SDP][SQL] Scope DataflowGraphRegistry to Session #51544

[SPARK-52432][SDP][SQL] Scope DataflowGraphRegistry to Session #51544

Uh oh!

JiaqiWang18 commented Jul 18, 2025 •

edited

Loading

Uh oh!

JiaqiWang18 commented Jul 18, 2025

Uh oh!

AnishMahto left a comment

Uh oh!

AnishMahto Jul 18, 2025

Uh oh!

AnishMahto Jul 18, 2025

Uh oh!

AnishMahto Jul 18, 2025

Uh oh!

AnishMahto Jul 18, 2025

Uh oh!

AnishMahto Jul 18, 2025

Uh oh!

AnishMahto Jul 18, 2025

Uh oh!

sryza left a comment

Uh oh!

sryza commented Jul 21, 2025

Uh oh!

Uh oh!

		.getIsolatedSessionIfPresent(SessionKey(defaultUserId, defaultSessionId))
		.getOrElse(throw new RuntimeException("Session not found"))

[SPARK-52432][SDP][SQL] Scope DataflowGraphRegistry to Session #51544

[SPARK-52432][SDP][SQL] Scope DataflowGraphRegistry to Session #51544

Uh oh!

Conversation

JiaqiWang18 commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

JiaqiWang18 commented Jul 18, 2025

Uh oh!

AnishMahto left a comment

Choose a reason for hiding this comment

Uh oh!

AnishMahto Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

AnishMahto Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

AnishMahto Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

AnishMahto Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

AnishMahto Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

AnishMahto Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

sryza left a comment

Choose a reason for hiding this comment

Uh oh!

sryza commented Jul 21, 2025

Uh oh!

Uh oh!

JiaqiWang18 commented Jul 18, 2025 •

edited

Loading